138 research outputs found

    The genome sequence of <i>Trypanosoma brucei gambiense</i>, causative agent of chronic Human African Trypanosomiasis

    Get PDF
    &lt;p&gt;&lt;b&gt;Background:&lt;/b&gt; &lt;i&gt;Trypanosoma brucei gambiense&lt;/i&gt; is the causative agent of chronic Human African Trypanosomiasis or sleeping sickness, a disease endemic across often poor and rural areas of Western and Central Africa. We have previously published the genome sequence of a &lt;i&gt;T. b. brucei&lt;/i&gt; isolate, and have now employed a comparative genomics approach to understand the scale of genomic variation between &lt;i&gt;T. b. gambiense&lt;/i&gt; and the reference genome. We sought to identify features that were uniquely associated with &lt;i&gt;T. b. gambiense&lt;/i&gt; and its ability to infect humans.&lt;/p&gt; &lt;p&gt;&lt;b&gt;Methods and findings:&lt;/b&gt; An improved high-quality draft genome sequence for the group 1 &lt;i&gt;T. b. gambiense&lt;/i&gt; DAL 972 isolate was produced using a whole-genome shotgun strategy. Comparison with &lt;i&gt;T. b. brucei&lt;/i&gt; showed that sequence identity averages 99.2% in coding regions, and gene order is largely collinear. However, variation associated with segmental duplications and tandem gene arrays suggests some reduction of functional repertoire in &lt;i&gt;T. b. gambiense&lt;/i&gt; DAL 972. A comparison of the variant surface glycoproteins (VSG) in &lt;i&gt;T. b. brucei&lt;/i&gt; with all &lt;i&gt;T. b. gambiense&lt;/i&gt; sequence reads showed that the essential structural repertoire of VSG domains is conserved across &lt;i&gt;T. brucei&lt;/i&gt;.&lt;/p&gt; &lt;p&gt;&lt;b&gt;Conclusions:&lt;/b&gt; This study provides the first estimate of intraspecific genomic variation within &lt;i&gt;T. brucei&lt;/i&gt;, and so has important consequences for future population genomics studies. We have shown that the &lt;i&gt;T. b. gambiense&lt;/i&gt; genome corresponds closely with the reference, which should therefore be an effective scaffold for any &lt;i&gt;T. brucei&lt;/i&gt; genome sequence data. As VSG repertoire is also well conserved, it may be feasible to describe the total diversity of variant antigens. While we describe several as yet uncharacterized gene families with predicted cell surface roles that were expanded in number in &lt;i&gt;T. b. brucei&lt;/i&gt;, no &lt;i&gt;T. b. gambiense&lt;/i&gt;-specific gene was identified outside of the subtelomeres that could explain the ability to infect humans.&lt;/p&gt

    No observed effect of homologous recombination on influenza C virus evolution

    Get PDF
    The occurrence of homologous recombination in influenza viruses has been under some debate recently. To determine the extent of homologous recombination in influenza C virus, recombination analyses of all available gene sequences of influenza C virus were carried out. No recombination signal was found. With the previous evidence in influenza A and B viruses, it seems that homologous recombination has minimal or no effect on influenza virus evolution

    Telomeric expression sites are highly conserved in trypanosoma brucei

    Get PDF
    Subtelomeric regions are often under-represented in genome sequences of eukaryotes. One of the best known examples of the use of telomere proximity for adaptive purposes are the bloodstream expression sites (BESs) of the African trypanosome Trypanosoma brucei. To enhance our understanding of BES structure and function in host adaptation and immune evasion, the BES repertoire from the Lister 427 strain of T. brucei were independently tagged and sequenced. BESs are polymorphic in size and structure but reveal a surprisingly conserved architecture in the context of extensive recombination. Very small BESs do exist and many functioning BESs do not contain the full complement of expression site associated genes (ESAGs). The consequences of duplicated or missing ESAGs, including ESAG9, a newly named ESAG12, and additional variant surface glycoprotein genes (VSGs) were evaluated by functional assays after BESs were tagged with a drug-resistance gene. Phylogenetic analysis of constituent ESAG families suggests that BESs are sequence mosaics and that extensive recombination has shaped the evolution of the BES repertoire. This work opens important perspectives in understanding the molecular mechanisms of antigenic variation, a widely used strategy for immune evasion in pathogens, and telomere biology

    OrgConv: detection of gene conversion using consensus sequences and its application in plant mitochondrial and chloroplast homologs

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The ancestry of mitochondria and chloroplasts traces back to separate endosymbioses of once free-living bacteria. The highly reduced genomes of these two organelles therefore contain very distant homologs that only recently have been shown to recombine inside the mitochondrial genome. Detection of gene conversion between mitochondrial and chloroplast homologs was previously impossible due to the lack of suitable computer programs. Recently, I developed a novel method and have, for the first time, discovered recurrent gene conversion between chloroplast mitochondrial genes. The method will further our understanding of plant organellar genome evolution and help identify and remove gene regions with incongruent phylogenetic signals for several genes widely used in plant systematics. Here, I implement such a method that is available in a user friendly web interface.</p> <p>Results</p> <p><monospace>OrgConv</monospace> (<b>Org</b>anellar <b>Conv</b>ersion) is a computer package developed for detection of gene conversion between mitochondrial and chloroplast homologous genes. <monospace>OrgConv</monospace> is available in two forms; source code can be installed and run on a Linux platform and a web interface is available on multiple operating systems. The input files of the feature program are two multiple sequence alignments from different organellar compartments in FASTA format. The program compares every examined sequence against the consensus sequence of each sequence alignment rather than exhaustively examining every possible combination. Making use of consensus sequences significantly reduces the number of comparisons and therefore reduces overall computational time, which allows for analysis of very large datasets. Most importantly, with the significantly reduced number of comparisons, the statistical power remains high in the face of correction for multiple tests.</p> <p>Conclusions</p> <p>Both the source code and the web interface of <monospace>OrgConv</monospace> are available for free from the <monospace>OrgConv</monospace> website <url>http://www.indiana.edu/~orgconv</url>. Although <monospace>OrgConv</monospace> has been developed with main focus on detection of gene conversion between mitochondrial and chloroplast genes, it may also be used for detection of gene conversion between any two distinct groups of homologous sequences.</p

    Origin of the Diversity in DNA Recognition Domains in Phasevarion Associated modA Genes of Pathogenic Neisseria and Haemophilus influenzae

    Get PDF
    Phase variable restriction-modification (R-M) systems have been identified in a range of pathogenic bacteria. In some it has been demonstrated that the random switching of the mod (DNA methyltransferase) gene mediates the coordinated expression of multiple genes and constitutes a phasevarion (phase variable regulon). ModA of Neisseria and Haemophilus influenzae contain a highly variable, DNA recognition domain (DRD) that defines the target sequence that is modified by methylation and is used to define modA alleles. 18 distinct modA alleles have been identified in H. influenzae and the pathogenic Neisseria. To determine the origin of DRD variability, the 18 modA DRDs were used to search the available databases for similar sequences. Significant matches were identified between several modA alleles and mod gene from distinct bacterial species, indicating one source of the DRD variability was via horizontal gene transfer. Comparison of DRD sequences revealed significant mosaicism, indicating exchange between the Neisseria and H. influenzae modA alleles. Regions of high inter- and intra-allele similarity indicate that some modA alleles had undergone recombination more frequently than others, generating further diversity. Furthermore, the DRD from some modA alleles, such as modA12, have been transferred en bloc to replace the DRD from different modA alleles

    Diversity of Prophage DNA Regions of Streptococcus agalactiae Clonal Lineages from Adults and Neonates with Invasive Infectious Disease

    Get PDF
    The phylogenetic position and prophage DNA content of the genomes of 142 S. agalactiae (group-B streptococcus, GBS) isolates responsible for bacteremia and meningitis in adults and neonates were studied and compared. The distribution of the invasive isolates between the various serotypes, sequence types (STs) and clonal complexes (CCs) differed significantly between adult and neonatal isolates. Use of the neighbor-net algorithm with the PHI test revealed evidence for recombination in the population studied (PHI, Pβ€Š=β€Š2.01Γ—10βˆ’6), and the recombination-mutation ratio (R/M) was 6∢7. Nevertheless, the estimated R/M ratio differed between CCs. Analysis of the prophage DNA regions of the genomes of the isolates assigned 90% of the isolates to five major prophage DNA groups: A to E. The mean number of prophage DNA fragments amplified per isolate varied from 2.6 for the isolates of prophage DNA group E to 4.0 for the isolates of prophage DNA group C. The isolates from adults and neonates with invasive diseases were distributed differently between the various prophage DNA groups (P<0.00001). Group C prophage DNA fragments were found in 52% of adult invasive isolates, whereas 74% of neonatal invasive isolates had prophage DNA fragments of groups A and B. Differences in prophage DNA content were also found between serotypes, STs and CCs (P<0.00001). All the ST-1 and CC1 isolates, mostly of serotype V, belonged to the prophage DNA group C, whereas 84% of the ST-17 and CC17 isolates, all of serotype III, belonged to prophage DNA groups A and B. These data indicate that the transduction mechanisms, i.e., gene transfer from one bacterium to another by a bacteriophage, underlying genetic recombination in S. agalactiae species, are specific to each intraspecies lineage and population of strains responsible for invasive diseases in adults and neonates

    Dynamics of Molecular Evolution and Phylogeography of Barley yellow dwarf virus-PAV

    Get PDF
    Barley yellow dwarf virus (BYDV) species PAV occurs frequently in irrigated wheat fields worldwide and can be efficiently transmitted by aphids. Isolates of BYDV-PAV from different countries show great divergence both in genomic sequences and pathogenicity. Despite its economical importance, the genetic structure of natural BYDV-PAV populations, as well as of the mechanisms maintaining its high diversity, remain poorly explored. In this study, we investigate the dynamics of BYDV-PAV genome evolution utilizing time-structured data sets of complete genomic sequences from 58 isolates from different hosts obtained worldwide. First, we observed that BYDV-PAV exhibits a high frequency of homologous recombination. Second, our analysis revealed that BYDV-PAV genome evolves under purifying selection and at a substitution rate similar to other RNA viruses (3.158Γ—10βˆ’4 nucleotide substitutions/site/year). Phylogeography analyses show that the diversification of BYDV-PAV can be explained by local geographic adaptation as well as by host-driven adaptation. These results increase our understanding of the diversity, molecular evolutionary characteristics and epidemiological properties of an economically important plant RNA virus

    High Density Microarray Analysis Reveals New Insights into Genetic Footprints of Listeria monocytogenes Strains Involved in Listeriosis Outbreaks

    Get PDF
    Listeria monocytogenes, a foodborne bacterial pathogen, causes invasive and febrile gastroenteritis forms of listeriosis in humans. Both invasive and febrile gastroenteritis listeriosis is caused mostly by serotypes 1/2a, 1/2b and 4b strains. The outbreak strains of serotype 1/2a and 4b could be further classified into several epidemic clones but the genetic bases for the diverse pathophysiology have been unsuccessful. DNA microarray provides an important tool to scan the entire genome for genetic signatures that may distinguish the L. monocytogenes strains belonging to different outbreaks. We have designed a pan-genomic microarray chip (Listeria GeneChip) containing sequences from 24 L. monocytogenes strains. The chip was designed to identify the presence/absence of genomic sequences, analyze transcription profiles and identify SNPs. Analysis of the genomic profiles of 38 outbreak strains representing 1/2a, 1/2b and 4b serotypes, revealed that the strains formed distinct genetic clusters adhering to their serotypes and epidemic clone types. Although serologically 1/2a and 1/b strains share common antigenic markers microarray analysis revealed that 1/2a strains are further apart from the closely related 1/2b and 4b strains. Within any given serotype and epidemic clone type the febrile gastroenteritis and invasive strains can be further distinguished based on several genetic markers including large numbers of phage genome, and intergenic sequences. Our results showed that the microarray-based data can be an important tool in characterization of L. monocytogenes strains involved in both invasive and gastroenteritis outbreaks. The results for the first time showed that the serotypes and epidemic clones are based on extensive pan-genomic variability and the 1/2b and 4bstrains are more closely related to each other than the 1/2a strains. The data also supported the hypothesis that the strains causing these two diverse outbreaks are genotypically different and this finding might be important in understanding the pathophysiology of this organism

    A Comparison of rpoB and 16S rRNA as Markers in Pyrosequencing Studies of Bacterial Diversity

    Get PDF
    Background: The 16S rRNA gene is the gold standard in molecular surveys of bacterial and archaeal diversity, but it has the disadvantages that it is often multiple-copy, has little resolution below the species level and cannot be readily interpreted in an evolutionary framework. We compared the 16S rRNA marker with the single-copy, protein-coding rpoB marker by amplifying and sequencing both from a single soil sample. Because the higher genetic resolution of the rpoB gene prohibits its use as a universal marker, we employed consensus-degenerate primers targeting the Proteobacteria. &lt;p/&gt;Methodology/Principal Findings: Pyrosequencing can be problematic because of the poor resolution of homopolymer runs. As these erroneous runs disrupt the reading frame of protein-coding sequences, removal of sequences containing nonsense mutations was found to be a valuable filter in addition to flowgram-based denoising. Although both markers gave similar estimates of total diversity, the rpoB marker revealed more species, requiring an order of magnitude fewer reads to obtain 90% of the true diversity. The application of population genetic methods was demonstrated on a particularly abundant sequence cluster. &lt;p/&gt;Conclusions/Significance: The rpoB marker can be a complement to the 16S rRNA marker for high throughput microbial diversity studies focusing on specific taxonomic groups. Additional error filtering is possible and tests for recombination or selection can be employed
    • …
    corecore